

RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/325, 603 



DATE: 07/08/1999 
TIME: 11:11:34 



INPUT SET; S32473.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



SEQUENCE LISTING 



(1) 



General Information: 




(i) APPLICANT: Svendsen, Allan 

Bisgrd-Frantzen, Henrik 
Borchert, Torben Vedel 

(ii) TITLE OF INVENTION: A-Amylase Mutants 

(iii) NUMBER OF SEQUENCES : 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Novo Nordisk of North America, Inc. 

(B) STREET: 405 Lexington Avenue, 64th Floor 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: United States of America 

(F) ZIP: 10174-6401 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/325,603 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/600,908 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Green, Reza 

(B) REGISTRATION NUMBER: 38,47 5 

(C) REFERENCE/ DOCKET NUMBER: 4394.204-US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212-867-0123 

(B) TELEFAX: 212-878-9655 



(2) INFORMATION FOR SEQ ID NO : 1 : 



PAGE: 2 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/325, 603 



DATE: 07/08/1999 
TIME: 11:11:34 



INPUT SET: S32473.raw 

47 

48 (i) SEQUENCE CHARACTERISTICS: 

49 (A) LENGTH: 1920 base pairs 

50 (B) TYPE: nucleic acid 

51 (C) STRANDEDNESS : single 

52 (D) TOPOLOGY: linear 
53 

54 (ii) MOLECULE TYPE: DNA (genomic) 

55 

56 (ix) FEATURE: 

57 (A) NAME/KEY: CDS 

58 (B) LOCATION: 334.. 1869 
59 

60 (ix) FEATURE: 

61 (A) NAME/KEY: sig_peptide 

62 (B) LOCATION: 334.. 420 
63 

64 (ix) FEATURE: 

65 (A) NAME /KEY : mat_peptide 

66 (B) LOCATION: 421.. 1869 
67 

68 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

69 

70 CGGAAGATTG GAAGTACAAA AATAAGCAAA AGATTGTCAA TCATGTCATG AGCCATGCGG 60 
71 

72 GAGACGGAAA AATCGTCTTA ATGCACGATA TTTATGCAAC GTTCGCAGAT GCTGCTGAAG 120 
73 

74 AGATTATTAA AAAGCTGAAA GCAAAAGGCT ATCAATTGGT AACTGTATCT CAGCTTGAAG 180 
75 

7 6 AAGTGAAGAA GCAGAGAGGC TATTGAATAA ATGAGTAGAA GCGCCATATC GGCGCTTTTC 240 
77 

7 8 TTTTGGAAGA AAATATAGGG AAAATGGTAC TTGTTAAAAA TTCGGAATAT TTATACAACA 300 
79 

80 TCATATGTTT CACATTGAAA GGGGAGGAGA ATC ATG AAA CAA CAA AAA CGG CTT 354 

81 Met Lys Gin Gin Lys Arg Leu 

82 -29 -25 
83 

84 TAC GCC CGA TTG CTG ACG CTG TTA TTT GCG CTC ATC TTC TTG CTG CCT 4 02 

85 Tyr Ala Arg Leu Leu Thr Leu Leu Phe Ala Leu lie Phe Leu Leu Pro 

86 -20 -15 -10 
87 

88 CAT TCT GCA GCA GCG GCG GCA AAT CTT AAT GGG ACG CTG ATG CAG TAT . 4 50 

8 9 His Ser Ala Ala Ala Ala Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr 
90-5 1 5 10 
91 

92 TTT GAA TGG TAC ATG CCC AAT GAC GGC CAA CAT TGG AGG CGT TTG CAA 4 98 

93 Phe Glu Trp Tyr Met Pro Asn Asp Gly Gin His Trp Arg Arg Leu Gin 

94 15 20 25 
95 

96 AAC GAC TCG GCA TAT TTG GCT GAA CAC GGT ATT ACT GCC GTC TGG ATT 54 6 

97 Asn Asp Ser Ala Tyr Leu Ala Glu His Gly lie Thr Ala Val Trp lie 

98 30 35 40 
99 
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PATENT APPLICATION US/09/325, 603 TIME: 1 1 : 1 1 :35 

INPUT SET: S32473.raw 

100 CCC CCG GCA TAT AAG GGA ACG AGC CAA GCG GAT GTG GGC TAC GGT GCT 594 

101 Pro Pro Ala Tyr Lys Gly Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala 

102 45 50 55 
103 

104 TAC GAC CTT TAT GAT TTA GGG GAG TTT CAT CAA AAA GGG ACG GTT CGG 642 

105 Tyr Asp Leu Tyr Asp Leu Gly Glu Phe His Gin Lys Gly Thr Val Arg 

106 60 65 70 
107 

108 AC A AAG TAC GGC AC A AAA GGA GAG CTG CAA TCT GCG ATC AAA AGT CTT 6 90 

109 Thr Lys Tyr Gly Thr Lys Gly Glu Leu Gin Ser Ala lie Lys Ser Leu 

110 75 80 85 90 
111 

112 CAT TCC CGC GAC ATT AAC GTT TAC GGG GAT GTG GTC ATC AAC CAC AAA 7 38 

113 His Ser Arg Asp lie Asn Val Tyr Gly Asp Val Val lie Asn His Lys 

114 95 100 105 
115 

116 GGC GGC GCT GAT GCG ACC GAA GAT GTA ACC GCG GTT GAA GTC GAT CCC 786 

117 Gly Gly Ala Asp Ala Thr Glu Asp Val Thr Ala Val Glu Val Asp Pro 

118 110 115 120 
119 

120 GCT GAC CGC AAC CGC GTA ATT TCA GGA GAA CAC CTA ATT AAA GCC TGG 834 

121 Ala Asp Arg Asn Arg Val lie Ser Gly Glu His Leu lie Lys Ala Trp 

122 125 130 135 
123 

124 AC A CAT TTT CAT TTT CCG GGG CGC GGC AGC ACA TAC AGC GAT TTT AAA 882 

125 Thr His Phe His Phe Pro Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys 

126 140 145 150 
127 

128 TGG CAT TGG TAC CAT TTT GAC GGA ACC GAT TGG GAC GAG TCC CGA AAG 93 0 

129 Trp His Trp Tyr His Phe Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys 

130 155 160 165 170 
131 

132 CTG AAC CGC ATC TAT AAG TTT CAA GGA AAG GCT TGG GAT TGG GAA GTT 97 8 

133 Leu Asn Arg lie Tyr Lys Phe Gin Gly Lys Ala Trp Asp Trp Glu Val 

134 175 180 185 
135 

136 TCC AAT GAA AAC GGC AAC TAT GAT TAT TTG ATG TAT GCC GAC ATC GAT 1026 

137 Ser Asn Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp lie Asp 

138 190 195 200 
139 

140 TAT GAC CAT CCT GAT GTC GCA GCA GAA ATT AAG AGA TGG GGC ACT TGG 1074 

141 Tyr Asp His Pro Asp Val Ala Ala Glu lie Lys Arg Trp Gly Thr Trp 

142 205 210 215 
143 

144 TAT GCC AAT GAA CTG CAA TTG GAC GGT TTC CGT CTT GAT GCT GTC AAA 1122 

145 Tyr Ala Asn Glu Leu Gin Leu Asp Gly Phe Arg Leu Asp Ala Val Lys 

146 220 225 230 
147 

148 CAC ATT AAA TTT TCT TTT TTG CGG GAT TGG GTT AAT CAT GTC AGG GAA 1170 

149 His lie Lys Phe Ser Phe Leu Arg Asp Trp Val Asn His Val Arg Glu 

150 235 240 245 250 
151 

152 AAA ACG GGG AAG GAA ATG TTT ACG GTA GCT GAA TAT TGG CAG AAT GAC 1218 
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INPUT SET: S32473.raw 

153 Lys Thr Gly Lys Glu Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp 

154 255 260 265 
155 

156 TTG GGC GCG CTG GAA AAC TAT TTG AAC AAA ACA AAT TTT AAT CAT TCA 1266 

157 Leu Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Phe Asn His Ser 

158 270 275 280 
159 

160 GTG TTT GAC GTG CCG CTT CAT TAT CAG TTC CAT GCT GCA TCG ACA CAG 1314 

161 Val Phe Asp Val Pro Leu His Tyr Gin Phe His Ala Ala Ser Thr Gin 

162 285 290 295 
163 

164 GGA GGC GGC TAT GAT ATG AGG AAA TTG CTG AAC GGT ACG GTC GTT TCC 1362 

165 Gly Gly Gly Tyr Asp Met Arg Lys Leu Leu Asn Gly Thr Val Val Ser 

166 300 305 310 
167 

168 AAG CAT CCG TTG AAA TCG GTT ACA TTT GTC GAT AAC CAT GAT ACA CAG 1410 

16 9 Lys His Pro Leu Lys Ser Val Thr Phe Val Asp Asn His Asp Thr Gin 
170 315 320 325 330 
171 

172 CCG GGG CAA TCG CTT GAG TCG ACT GTC CAA ACA TGG TTT AAG CCG CTT 1458 

173 Pro Gly Gin Ser Leu Glu Ser Thr Val Gin Thr Trp Phe Lys Pro Leu 

174 335 340 345 
175 

17 6 GCT TAC GCT TTT ATT CTC ACA AGG GAA TCT GGA TAC CCT CAG GTT TTC 1506 

177 Ala Tyr Ala Phe lie Leu Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe 

178 350 355 360 
179 

180 TAC GGG GAT ATG TAC GGG ACG AAA GGA GAC TCC CAG CGC GAA ATT CCT 1554 

181 Tyr Gly Asp Met Tyr Gly Thr Lys Gly Asp Ser Gin Arg Glu lie Pro 

182 365 370 375 
183 

184 GCC TTG AAA CAC AAA ATT GAA CCG ATC TTA AAA GCG AGA AAA CAG TAT 1602 

185 Ala Leu Lys His Lys lie Glu Pro lie Leu Lys Ala Arg Lys Gin Tyr 

186 380 385 390 
187 

188 GCG TAC GGA GCA CAG CAT GAT TAT TTC GAC CAC CAT GAC ATT GTC GGC 16 50 

189 Ala Tyr Gly Ala Gin His Asp Tyr Phe Asp His His Asp lie Val Gly 

190 395 400 405 410 
191 

192 TGG ACA AGG GAA GGC GAC AGC TCG GTT GCA AAT TCA GGT TTG GCG GCA 1698 

193 Trp Thr Arg Glu Gly Asp Ser Ser Val Ala Asn Ser Gly Leu Ala Ala 

194 415 420 425 
195 

196 TTA ATA ACA GAC GGA CCC GGT GGG GCA AAG CGA ATG TAT GTC GGC CGG 1746 

197 Leu lie Thr Asp Gly Pro Gly Gly Ala Lys Arg Met Tyr Val Gly Arg 

198 430 435 440 
199 

200 CAA AAC GCC GGT GAG ACA TGG CAT GAC ATT ACC GGA AAC CGT TCG GAG 17 94 

201 Gin Asn Ala Gly Glu Thr Trp His Asp lie Thr Gly Asn Arg Ser Glu 

202 445 450 455 
203 

204 CCG GTT GTC ATC AAT TCG GAA GGC TGG GGA GAG TTT CAC GTA AAC GGC 184 2 

205 Pro Val Val lie Asn Ser Glu Gly Trp Gly Glu Phe His Val Asn Gly 
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RAW SEQUENCE LISTING DATE: 07/08/1999 

PATENT APPLICATION US/09/325, 603 TIME: 1 1 : 1 1 :35 

INPUT SET: S32473.raw 

460 465 470 

GGG TCG GTT TCA ATT TAT GTT CAA AG A TAGAAGAGCA GAGAGGACGG ' 188 9 
Gly Ser Val Ser lie Tyr Val Gin Arg 
475 480 

ATTTCCTGAA GGAAATCCGT TTTTTTATTT T 1920 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Lys Gin Gin Lys Arg Leu Tyr Ala Arg Leu Leu Thr Leu Leu Phe 
-29 -25 -20 -15 

Ala Leu lie Phe Leu Leu Pro His Ser Ala Ala Ala Ala Ala Asn Leu 
-10 -5 1 

Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro Asn Asp Gly 
5 10 15 

Gin His Trp Arg Arg Leu Gin Asn Asp Ser Ala Tyr Leu Ala Glu His 
20 25 30 35 

Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly Thr Ser Gin 
40 45 50 

Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
55 60 65 

His Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Gly Glu Leu 
70 75 80 

Gin Ser Ala lie Lys Ser Leu His Ser Arg Asp lie Asn Val Tyr Gly 
85 90 95 

Asp Val Val lie Asn His Lys Gly Gly Ala Asp Ala Thr Glu Asp Val 
100 105 110 115 

Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val lie Ser Gly 
120 125 130 

Glu His Leu lie Lys Ala Trp Thr His Phe His Phe Pro Gly Arg Gly 
135 140 145 



SEQUENCE VERIFICATION REPORT DATE: 07/08/1999 

PATENT APPLICATION US/09/325, 603 TIME: 1 1 : 1 1 :36 



INPUT SET: S32473.raw 



Original Text 



